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Abstract 

The problem of isolating real roots of a square-free polynomial inside a given interval Jq is a funda¬ 
mental problem. Subdivision based algorithms are a standard approach to solve this problem. Given 
an interval 7, such algorithms rely on two predicates: an exclusion predicate, which if true means I has 
no roots, and an inclusion predicate, which if true, reports an isolated root in I. If neither predicate 
holds, then we subdivide the interval and proceed recursively, starting from Jq. Example algorithms are 
Sturm’s method (predicates based on Sturm sequences), the Descartes method (using Descartes’s rule 
of signs), and Eval (using interval-arithmetic). For the canonical problem of isolating all real roots of 
a degree n polynomial with integer coefficients of bit-length L, the subdivision tree size of (almost all) 
these algorithms is bounded by 0{n{L + logn)). This is known to be optimal for subdivision based 
algorithms. 

We describe a subroutine that improves the running time of any subdivision algorithm for real root 
isolation. The subroutine first detects clusters of roots using a result of Ostrowski, and then uses Newton 
iteration to converge to them. Near a cluster, we switch to subdivision, and proceed recursively. The 
subroutine has the advantage that it is independent of the predicates used to terminate the subdivision. 
This gives us an alternative and simpler approach to recent developments of Sagraloff (2012) and Sagraloff- 
Mehlhorn (2013), assuming exact arithmetic. 

The subdivision tree size of our algorithm using predicates based on Descartes’s rule of signs is 
bounded by O(nlogn), which is better by 0(n log L) compared to known results. Our analysis differs in 
two key aspects. First, we use the general technique of continuous amortization from Burr-Krahmer-Yap 
(2009), and second, we use the geometry of clusters of roots instead of the Davenport-Mahler bound. 
The analysis naturally extends to other predicates. 


1 Introduction 

Given a polynomial / € of degree n, the problem is to isolate the real roots of / in an input interval /q, 
i.e., compute disjoint intervals which contain exactly one real root of /, and together contain all roots of / in 
Iq n M. Subdivision based algorithms have been successful in addressing the problem. A general subdivision 
algorithm uses two predicates, given an interval I: the exclusion predicate Coil), which if true means I has 
no roots; the inclusion predicate, Ciil), which if true means I has exactly one root. The algorithm outputs 
a root-partition V of Iq, i.e., a set of pairwise disjoint open intervals such that for each interval either Cq 
or Cl holds, and Io\V contains no roots of /. To compute isolating intervals for roots of /, check the sign 
of / at the endpoints of the intervals in V (this works if / is square-free). The following generic subdivision 
algorithm constructs a root-partition: 
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Isolate(/o) 

Input: / G M[x] and an interval Jq C M. 

Output: A root-partition V of f in Iq. 

0. Preprocessing step. 

1. Initialize a queue Q with Iq, and V 

2. While Q is not empty 

Remove an interval I = (a, b) from Q. 

If C'o(/) V Ci{I) then add I to V. 

else <1 Subdivide I 
Let m ^ (a + &)/2. 

Push (a, m) and (m, b) into Q. 

3. Output V. 

The algorithm is guaranteed to terminate for square-free polynomials; otherwise we get an infinite sequence 
of intervals converging to a root of multiplicity greater than one. Some standard choices of the predicates 
and the corresponding algorithms are: 

(i) Sturm sequences and Sturm’s method [Dav85| . 

(ii) Descartes’s rule of signs and the Descartes method |CA76j . 

(iii) Interval-arithmetic based approaches and Eval [BKY09) . 

The complexity of these algorithms is well understood for the benchmark problem of isolating all real roots 
of a square-free integer polynomial with coefficients of bit-length L. One measure of complexity is the size of 
the subdivision tree constructed by the algorithm. For the first two algorithms a bound of 0{n{L + logn)) 
was shown in [Dav85| and |ESY06) . respectively. For Eval a weaker bound of 0{n{L + n)) was established 
in |SYI2) . It is also known |ESY06] that the bound 0{n{L + logn)) is essentially tight for any algorithm 
doing uniform subdivision, i.e., reduces the width at every step by some constant (in our case by half). 

Uniform subdivision cannot improve on the bound mentioned above because it only gives linear conver¬ 
gence to a “root cluster”, i.e., roots which are relatively closer to each other than to any other root. But it is 
known that from points sufficiently far away from the cluster, Newton iteration (more precisely, its variants 
for multiple roots) converges quadratically to the cluster. This has been an underlying idea in improving 
the linear convergence of subdivision algorithms for root isolation [PanOOl |SagI2[ ISM13| , and has also been 
combined with homotopy based approaches [YakOOl IST09j . 

We follow the same idea with some key differences. Given Co and Ci, our algorithm can be described as 
follows (we only give the inner loop, see Section]^ for complete details): 

Newton-Isol(/o) 

If Co{I) V Cl (I) then add / to V. 
else if a cluster C of roots is detected in / then 
Apply Newton iteration to approximate C 
while quadratic convergence holds. 

Estimate an interval J containing C. 

Push J into Q. 
else < Subdivide I 


For detecting root clusters, we use a result of Ostrowski based on Newton diagram of a polynomial |Ost40) : 
other choices are based on a generalization of Smale’s a-theory (see |GLSY0!^ and the references therein); 
the details can be found in Section]^ These tools and approaches have been used earlier [PanOO] . however, 
our approach has the following differences: 

(i) The tools used to detect and estimate the size of a cluster are independent of the particular choice of 
the exclusion-inclusion predicates (cf. |SagI2| ). This way we obtain a general approach to improve any 
subdivision algorithm. 

(ii) Another difference is the method that is combined with bisection to improve convergence. In |Sagl2| 
Abbott’s QIR method is combined with the Schroder operator [GLSY05], whereas we apply standard 
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Newton iteration to a suitable derivative of /. The former combination is a backtracking approach to 
get quadratic convergence; the latter gives quadratic convergence right away (but perhaps increasing 
subdivisions). This has the advantage of separating the Newton iteration steps from the subdivision 
tree, which is reflected in the bounds on the subdivision tree size for the two approaches: for the former 
we have 0(nlog(nL)), and for the latter we have 0(n logn). The number of quadratically converging 
steps remains the same in both cases. 

(iii) Our approach can be modified to isolate complex roots; replace binary subdivision with a quad-tree 
subdivision, and choose appropriate predicates (e.g., Ostrowski’s result mentioned above, or Pellet’s 
test). This avoids Graeffe iteration (cf. |PanOO| b and yet the modified algorithm can be shown to 
attain a near optimal bound on subdivision tree size. 


In this paper, we focus on bounding the size of subdivision tree of Newton-Isol. For this purpose, we 
use the general framework of continuous amortization |BKY09l IBurl3) . The key idea here is to bound the 
tree size by G{x)dx, where G is a suitable “charging” function corresponding to the predicates used in 
the algorithm (e.g., see IBurlSp . Our key contributions are the following: 

(i) We derive a near optimal bound of 0(n log n) on the size of the subdivision tree of Newton-Isol when 
Go, Gi are based on Descartes’s rule of signs (see Theorem 101. This is the first application of the 
continuous amortization framework to a non-uniform subdivision algorithm. 

(ii) We show that if the distance of the cluster center to the nearest root outside the cluster exceeds roughly 

times the diameter of the cluster, then Ostrowski’s criterion for cluster detection works, and we 
obtain quadratic convergence to the cluster center (see Lemma [^. 

(iii) Our analysis crucially uses the cluster tree of the polynomial (see Proposition]^. We derive an integral 
bound on the size of the subdivision tree (see Theorem]^. The usual approach to upper bound this 
integral is to break it over the (real) Voronoi regions of the roots |Burl3| . We instead break the 
integral over the Voronoi regions corresponding to the clusters in an inductive manner based on the 
cluster tree. The integral over the portion of the region outside the cluster is bounded using known 
techniques. However, for the portion inside the cluster, we devise an amortized bound on the integral 
(see Lemma |12[ ), which is of independent interest, and is analogous to the improvement given by 
Davenport-Mahler bound over repeated applications of the root separation bound. It is this result that 
underlies the O(nlogn) bound. A simple argument extends these bounds to Sturm’s method and the 
Eval algorithm. The details are in Section]^ 


2 Notation and Basic Results 

Let / G K[a:] be a square-free polynomial of degree n > 2 and Z{f) C C be its set of roots. Given a 
finite pointset S' C let Ds be the disc D{ms,rs) such that ms is the centroid of the points in S, and 
rs is the least radius such that all the points in S are contained in D(ms,rs)- Given a A G R >07 define 
\Ds := D{ms, Ars). We borrow the following definition from [SSY13] : A subset C C Z{f) of size at least two 
is called a (root) cluster if the only roots in 3Dc are from C. We treat individual roots as (trivial) clusters. 
In this paper, the non-real roots in C come in conjugate pairs. Therefore, the center of Dc will always be in 
M. Define Rc as the distance from me to the nearest point in the set Z{f) \ C. From the definition it follows 
that Z{f) trivially forms a cluster and Rz{f) = oo. Given an interval I, let m{I) denote its midpoint and 
w{I) its width. We will often use the shorthand I = [m{I) ± w(/)/2], and for A > 0, A/ := [m(/) ± \w[I)/2]. 
An interval I contains a cluster C if C C D(m{I),w{I)/2). 

We use the following convenient notation in the subsequent definitions: for a:, y G K, ‘x ^ y’ if there is a 
constant c > 1 such that x > cy\ similarly define x <C y. 

A strongly-separated cluster (ssc) is a cluster C for which Rc/rc ^ n^', the exact constant can be 
found in Gorollary]^ For a ssc C, define the following three quantities: 

(i) The interval Iq := [me ± c • fcre], for some constant c > 1. 

(ii) The interval Zc := {x : |x — mc| <C Re/n^'\- 

(iii) The annulus Ae '■='Ic \ /c = {-^ G C : |C|rc <C 1^ — mc| Re/ri^]- 

The exact constants in these definitions are given in LemmaSee Figure]^ for an illustration of these 
concepts. If C is not a ssc, then we define le '■= [me ± re] and Xc := 2/c• Note that for all clusters C, le QZe- 
We will need the following result later in our analysis |SSY131 Lemma 2.1]: 
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Proposition 1. Given a root cluster C of f. There is a unique unordered tree Tq rooted at C whose set of 
nodes are the clusters contained in C, and the parent-child relation is subset inclusion. Let Tf be the tree 
where the parent is the cluster Z{f) of all roots. 

The result originally is stated for root clusters of / G C[a;]. However, for / G K[a;] the clusters come in 
conjugate pairs, and by taking the union of such pairs the result still holds. The tree Tc is called the cluster 
tree of C. The leaves of this tree are the roots in C. 
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Figure 1: Geometry of a ssc C. We focus on the the relative geometry, overlooking the exact constants 
involved in the definition of the intervals. 


2.1 Cluster Detection and Approximation 

The literature on detection and approximation of root clusters is vast (see |GLSY05] and the references 
therein). One approach is based on Pellet’s test: if for a complex polynomial f{x) = there is an 

r > 0 such that |afc|r^' > then the disc D{0, r) contains exactly k roots of /. A point z G C is said 

to satisfy Pellet’s test, if there is a fc and r for which the test holds with the coefficients of f{x-\-z). Results 
in [YakOOl [GLSY05| generalize Smale’s a-theory and relate it to Pellet’s test; an alternative derivation based 
on tropical algebra is given in [Shall) . We instead use a result by Ostrowski [OstlO] . 

We need the following definitions. Let f{x) = where Oi G C. With each index z, m ^ 0, 

associate the point Pi := {i, — log |ai|) G The lower-hull of the convex-hull of these points is called the 
Newton diagram of /. Given an index k G {0, ...,n}, let yk be the point such that {k,yk) is on the 
diagram. Define := for 1 < A: < n, Pn+i:=oo, and the A:th deviation Aj, := pfe+i/p^., for 

0 < A: < n. 

Let oi,..., On G C be the roots of / ordered such that |ai| < |q: 2 | < • • • < \an\. Ostrowski showed the 
following fundamental relation between the absolute values of the roots and pk ’s |()st4ni p. 143]: 

— <2(n-fc + l). (1) 

2k Pk 


Given z G C, we will be interested in the Newton diagram of f{x z). If fj{z) := f^^\z)/j\, then from a 
result of Ostrowski |Ost401 p. 128] we get: 


Pfe(z) = max 
j<k 


fjjz) 

fk{z) 


1 

(fc-j) 


and pk+i{z) = min 

j>k 


fkjz) 

/j(^) 


1 


(j-fc) 


( 2 ) 


The RHS is defined for any k such that fk{z) ^ 0; however, we are only interested in those k for which Pk 
is on the diagram. The A:th deviation Ak{z) := pk-ki{z)/Pk{z). We have the following result for detecting 
clusters: 


Lemma 2. If Ak{z) > 27, for some index 0 < k < n, then there are exactly k roots in D{z,3pk{z)) and 
Zl(z,pfc+i(z)/3). Moreover, as pfc+i(z)/3 > 9pfc(z), these roots form a cluster. 

The proof shows that the inequality Ak{z) > 27 implies that Pellet’s test holds for D{z, r), 3pk{z) < r < 
Zl(z,pfc+i(z)/3) (see [GLSYOSl Thm. 1.5]). Since the Pfs are sorted by x-coordinate, all the pkS can be 
computed in 0(n) steps using, e.g., Graham’s scan for convex hull computation. 

Once we have detected a cluster C near z, we want a good approximation to me- A standard way is to 
do the iteration z^+i = Zi — kf{zi)/f'{zi), starting from z, but this may not be numerically desirable, as 


4 

































both / and /' are small near C. Another option is to use the standard Newton iteration applied to 
We show that if Ak{z) > 27, then z is an approximate zero, in the sense of Smale et al. |BCSS98[ p. 160, 
Thm. 2], to the root of in D{z, )• Subsequently we show that if Aj,(z) > cg, for some constant 

Co > 27, then for all z' in this disc Afc(z') > 27, and hence there is a cluster of k roots in D{z',3pk{z')). 
Moreover, the cluster is exactly C. These results are summarized in the following: 

Lemma 3. Let z G C be such that Afc(z) > Cq, for some k > 2, C be the cluster in D{z,3pk{z)), and 
D' :=D{z, ) • Then the following hold: 

(i) z is an approximate zero to the root z* of in D' and the Newton iterates starting from z are in 

D'. 

(a) For all z' G D', Afc(z') > 27, and C is the cluster in D{z',3pk{z')). 

(Hi) If z,w are such that Afc(z), Afc(ty) > 27 and D(z,3pk{z)), D{w,3pk{w)) intersect then the discs have 
the same cluster. 

The proof is given in the appendix. We choose Cg :=27 x 6e®. Given z S C, a value of k satisfying 
the condition Afc(z) > cg is called an admissible value for z, with the corresponding inclusion disc 
D{z, 3pk{z)). Note that there can be more than one admissible value for a point z corresponding to clusters 
of different sizes. 


3 The Algorithm 

Let Cg and Ci be some exclusion and inclusion predicate respectively. The following algorithm takes as 
input / and an interval /g and outputs a root partition of Jg. 


Newton-Isol(/, /g) 

1 Initialize 7^ ^ 0, <i> ^ 0; let Q be an empty queue. 

1. a. If this is a recursive call then subdivide /g and 

push the two halves into Q\ else Q G- {/g}. 

2. While Q is not empty do 

Remove an interval / from Q. 

2.a. If C'g(J) V C'i(J) then add I to V. 

else if Newton-Incl-Exc(/) is successful then 
Let (J, k) be the pair returned. 

2.b. If VJ' S $, J n J' = 0 and J H /g ^ 0 then 

2.C. V r G g, r ^ r \ l>(to(j), 

2. d. Add J n /g to <i>. 

else subdivide I and push the two halves into Q. 

3. Return 7^ Ujg$ Newton-Isol(/, J). 

The input to Newton-Incl-Exc is an interval / = {a,b). If the predicate is successful then it returns an 
interval J containing a cluster such that w{J) < w(/)/2, and an admissible value k for m(J); otherwise it 
returns failure. 
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Newton-Incl-Exc (/,/) 

1. 

Let m:= (a -1- b)/2. 

2. 

For p G {a, m, 6}, let fcp > 2 be the smallest admissible 
value k for p such that I G D (p, EMilE)'j ^ 

3. 

If the three admissible values are equal and the three 
inclusion discs are contained in D (jn, then: 

3.a. 

Zo := m, k := k^, 9 ■= i ■= 0. 

4. 

While pk{zi) < 2^-'^'pk{zo) 

Zi+i := Zi - g{zi)/g'{zi)\ i:=i + l. 

J := [zi_i ± 3pfc(zi_i)] 

5. 

If w{J) > w{I)/2 then return failure 

6. 

else return {J,k). 

7. 

Return failure. 


We first explain some steps in the predicate above: 

Step 2. A point pin I can have more than one admissible value associated with it. The right admissible 
value is governed by w{I), since we should only consider those clusters C for which rc ^ w{I) <C Rc- 
Step 3. As D{m, /i) contains all the three inclusion discs, they all contain the same cluster 

C. Otherwise, it is possible that the three inclusion discs contain different clusters but of the same size. 
Step 4. This ensures that as Zi converges to the root of the distance to C decreases quadrati- 

cally; this fails when we are near C, or the root of is not near C. 

Step 5. Required to ensure linear convergence to C. 

Step 6. The interval J contains the cluster C. Moreover, as I C D{m, pk+i{m)/3), we know that if 
the roots in I are a subset of C, and hence are inside J. By now w{J) < w(/)/2, therefore, it suffices 
to return J. 

We now comment on some steps in Newton-Isol: 

Step l.a. Ensures that a successful call to Newton-Incl-Exc is followed by a subdivision step. Thus 
the recursion tree is a binary tree. The predicate can still be successful on an interval J returned 
by an earlier successful call. But the convergence in this case would only be linear, and so we prefer 
subdivision, though in practice one can go ahead with the linear convergence. 

Step 2.b. Checks if C has not been found before (see Lemma|^iii)), and that J is inside /q; if either 
of this test fails, then I contains no roots and can be excluded. 

Step 2.C. As the only cluster in D(rn{ J), Ph±lipiDl'^ jg remove this disc from the intervals 

in Q. It is this exclusion step that significantly contributes to the improvement of the subdivision 
algorithm. 

Step 2.d. This step adds the interval J n /q containing the newly discovered cluster C to the set $. 
There are only two loops in the algorithm: first, the while-loop in step 2 of the algorithm, and second, the 
Newton iteration in step 4 of Newton-Incl-Exc. The argument for the termination of the first loop is the 
same as for Isolate. The termination of the second loop is guaranteed, because if Zi’s are such that pk{zi) 
keeps on decreasing, then in the limit pfc’s converge to zero; but the disc D{zi,3pk{zi)) contains exactly k 
roots; since, in the limit x^’s tend to a root z* of this implies that z* is a fc-fold root of /, which is a 

contradiction as / is square-free. 

The following is a proof of correctness of the algorithm. 

Theorem 4. Given a polynomial f and an interval /q, Newton-Isol{f, Iq) outputs a root partition V of Iq. 

Proof. We need to show the following claims: 

1. IqXV contains no real roots of /. 

2. V contains (interior) pairwise disjoint intervals. 

3. For all I gV, Cq or Ci holds (follows from step 2.a.). 

Lemma|^gives us the correctness of Newton-Incl-Exc(/), i.e., if the test is successful then it returns an 
interval J such that any roots in I are contained in J. We only argue for the first claim. For every interval 


6 






J returned by a successful call of the predicate, define 

Aj :=£) (TO(J),pfc+i(m(J))/3) \ D{m{J),w{J)/2), (3) 

i.e., the annulus around J that does not contain any roots. We exclude intervals if step (2.b) fails for the 
interval J, or a portion of an interval is removed in step (2.c.). In the former case, either the cluster contained 
in J was already detected, or it is outside /q. In the latter case, we do not loose any roots since Aj has no 
roots. So /g \ "P contains no roots. Q.E.D. 


4 Complexity Analysis 

The main result is that Newton-Incl-Exc will be successful near a ssc C. Let cg > 20 be the constant in 
Lemma 1^ and C a ssc throughout this section. Our first claim is that \C\ is an admissible value for all points 
in Tc- 

Lemma 5. If \z — mc\ < Rc / then Ak{z) > cq . 

Proof. Let ai,...,ak € C and ak+i, ■ ■ ■ ,an G Z{f) \C. Moreover, assume that they are ordered in 
increasing distance from z. From Q, we know that 2k\z — ak+i\ > Pk+i{z) > \z — ak+i\/{2{n — k + I)). 
Moreover, \z — ak+i \ > Rc — \z — mc \ > Rcl‘^\ similarly, |2 — ak+i\ < 3Rc/2. Therefore, 

^<Pfc+i(^)<3|C|i?c. (4) 

From Q, we again have Pk{z) < 2k\z — ak\. But \z — ak\ < \z — mc \ + rc, which gives us 

Pkiz) <2k(\z-mc\+rc). (5) 

Since \z — mc\,rc < Rc/{8con'^), we get Pk{z) < /ci?c/(2cgn^). Combining this with Q, and the observation 
that (n — k)k < n^/4, we obtain that > 2con^/(8(n — k)k) > cg. Q.E.D. 

Recall the definition of the intervals Ic , Rc and the annulus Ac from Section and Aj from (|^. 

Lemma 6. If an interval I is such that 

/ C Ic = [me ± i?c/(8cgn^)] and w{I) > 72|C|rc 

then the pair {J,k) returned by Newton-Incl-Exc{I) is such that k = \C\, J ^ Ic = [me ± 20krc], and 
Aj A Ac- 

Proof. We show that the conditions on / above imply that Newton-Incl-Exc(/) reaches step 6 of 
Newton-Incl-Exc (all the steps below refer to the steps in the predicate). This requires showing the fol¬ 
lowing: (i) all the conditions in step 3 are met; (ii) Newton-iteration in step 4 converges quadratically 
terminating with an interval J with w{J) < w{I)/2, and (hi) J Q Ic- The following claims provide the 
proof. Let / = [a, b] and m = m{I)- 

Claim 1: For all p G {a,m,b}, kp = \C\- Recall from Step 2 that kp is defined as the smallest admis¬ 
sible value k for which I C D{p, pk+i{p)/^)- From Lemmaj^ we have kp < \C\- Since the roots in / 
can only come from C, any smaller admissible value corresponds to a subcluster C of C, which implies 
Rc "Zi fc- From 0 we know that p\c'\+i(j>) A 3(|C'| -I- l)i?c' A 3|C|rc. Since w{I) > 72|C|rc, clearly 
/ 2 D{p, P\C'\+i{p)/^) for any subcluster C' C C. Thus kp > \C\- 

Claim 2: For all p G /, / C D{p, pk+i{p)/5)- This will follow from the more general claim that 

Di :=D(mc,i?c/(8cgn^)) C D{z, p\c\+i{z)/3) =: D 2 , 

for all z G Di] since a,m,n G I 'G Di, the claim holds. But for any z G Di, we know from Q that 
^ which is greater than the diameter of Di, for cg > 3. 
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Claim 3: For all z,w G -Di, the inclusion disc D{z,3pk{z)) C D{w, This follows if 

\z-w\ +3pfc(z) < (6) 

But \z — w\,Tc < RcK^cqtt?), which along with ([^ implies that 3pk{z) < 6fci?c/(4con^). Therefore, 
LHS of ([^ is smaller than 13kRc/{8con^), which is smaller than Rc/{12n) for cq > 20, but from Q 
we know that the latter is smaller than the RHS of (§. 

Claim 4: Let Zi be the sequence of iterates computed in the while-loop in Step 4. If Zi S D{mc, )\ 

D{mc,2rc), then pk{zi) < Since Zi ^ D{mc,2rc), rc < \zi — mc|, and hence from ([Sj) we 

obtain pk{zi) < 4fc|zi — mc\- From |Paw99l Thm. 2.2] we know that there is a unique root z* of 
in D{mc,rc)- Therefore, jz^ —mc| < jz^ —z*|-|-rc. But as Zi ^ D(me, 2rc) and z* G H(me, rc), we have 
T'C < jzi —z* I, and hence IZj —me I <2|zi —z*|. Thus, Pk{zi) <8fc|zi —z*|. As zq is an approximate zero 
to z* (see Lemma|^i)), we know jz—z*| < 2^“^*|zo — z:*|, which implies that pk{zi) < 2^“^’fc|zo — z*|. 
Furthermore, from Lemmaj^i) we know fcjzo — z*| < 2pk{zo). Hence pk{zi) < 2®“^ Pk{zo). 

Claim 5: The interval J C Iq and w{J) < w{I)/2. The previous claim shows that if Zi ^ D{mc, 2rc), 
then we will obtain quadratically decreasing values of pk{zi). Thus when the iteration stops Zi G 
D{mc,2rc), and it follows from ([^ that pk{zi) < Gkrc- Hence the interval J = z^ ± 3pk{zi) is 
contained in Ic, for k>2. Moreover, w{J) < ZQkrc < w{I)/2, and hence the condition in Step 5 fails 
and we return J. The claim on the annulus follows from Q. 

Q.E.D. 

The following result translates the result above in terms of the subdivision tree: 

Corollary 7. Let C be a ssc such that He Q lo- If I is the first interval such that Newton-Incl-Exc{I) is 
successful and the interval returned contains C, then Ic C /' U I", where I' is the parent-interval of I and 
I” is one of I' ’s neighbors. 

Proof. In the worst case, C will be detected the first time in the subdivision tree an interval / C Ic. For 
such an I, we show w{I) 3> krc. Since I is the first interval to fall in Ic, both I' and I” have endpoints 
outside Ic, thus Ic C /' U So 2w{I) > Rc /(16con^) > 72krc, as C is ssc. The claim clearly holds if C is 
detected at an ancestor of I. Q.E.D. 

Remark: The proof above gives us the explicit constant in the definition of ssc, namely, we require 
Rcirc > 16co X 72n^. A careful working out of the proofs shows that the weaker inequality Rc/rc > 
4co X 72{n — |C|)|Cp, (or even 50con^) is sufficient. 

Recall that the set of all roots Z{f) is a cluster. As a consequence of Lemma we assume that 
/q C nlz(f)', otherwise Newton-Incl-Exc will be successful right away and the interval returned will satisfy 
the property. 

4.1 An Integral Bound on the Subdivision Tree 

Let JV{Io) be the set of leaves in the subdivision tree of Newton-Isol(/, Iq). Step l.a. of the algorithm 
ensures that the subdivision tree is a binary tree. Therefore, it suffices to bound |A/'(/o)l- For this purpose, 
we use the general framework of continuous amortization develooed in |BKY09| and eeneralized in [Bur m- 
The idea is to bound |A/'(/o)| by an integral and then derive an upper bound on this integral. For this 
purpose we need the following notion: Given a choice of predicates Cq, Ci, a function G : K —)■ K>o is called 
a stopping function corresponding to Gq and Gi if for every interval I, if there is an a: G / such that 
w{I)G{x) < 1, then either Co{I) or Gi(/) holds. Stopping functions, corresponding to different predicates, 
are provided in |Burl3j . The crucial property of G{x) is the following: 

Lemma 8. If Gq{I) and Gi(/) fail for an interval I, then for all J Q I, such that 2w{J) > w{I), 
2 f j G{x)dx > 1. 













Proof. From the definition of G{x), we have for all x G I, G{x)w{I) >1. As J C /, Vx e J, 2G{x)w{J) > 
G{x)w{I) > 1. Thus 2 JjG{x)dx > 2w{J) mhixej G{x) > 1. Q.E.D. 

The main result of this section is the following: 

Theorem 9. 

|A/'(/o)| < 4n + 2 f G{x)dx, 

J /o\Uc-4.c 

where the union is over all ssc C in Tj. 

We bound Af{Io) recursively. The leaves in Af{Io) correspond to three types of intervals: 

(i) intervals in the root partition V, 

(ii) intervals that were discarded in step 2.c., and 

(iii) intervals for which condition 2.b fails to hold (either cluster already found, or J n Jq = 0). 

We will bound each of these three types. We analyse what happens before the first set of recursive calls. 

Let $ be the set of intervals collected in Step 2.d. of the algorithm, Aj be as defined in ([^, and 
Ij := J U Aj. From the construction of $, we know that all intervals J G <f> are contained in Iq and each 
contains a unique cluster. For each J G $, let Lj be the set of parent-intervals of intervals in Q that intersect 
2j; the type (ii) intervals are children of intervals in Lj. Let Mj be the set of intervals that do not intersect 
Ij and are of type (iii). See Figure]^ for an illustration of these types. Note that ii I G Lj contains an 
endpoint of Ij, then I\Tj can be of type (i) or (iii); but there can be at most two such intervals for each J 
on either side of Tj. We abuse notation and use Lj to represent a set as well as the union of the intervals 
in it; same for Mj. 


Mj or P Intervals in Lj Mj or V 

\ / \ \ \ 

Q Q 0 0 0 

/ \ Jiy 'I'A ' 

■I—^- K I T I 4 - ^- H -^-1 ' 

.7 

Ij 

Figure 2: The three types of intervals in Af (Iq). Intervals in Lj are shown in green. The remaining intervals 
could be in Mj or V. The width of the red colored intervals can be much smaller than their parents. But 
there are at most two such intervals. 

For an / G Mj, both Cq and Ci failed. Therefore, from Lemma [^we get \Mj\ < IiGix)dx = 

2 G{x)dx. As the predicates Cq and Ci also fail for the intervals in Lj, we can similarly bound \Lj\. But 
this effectively amounts to doing subdivision on J. To avoid this we do the following: since the width of the 
intervals in Lj is more than w{ J), we know that there are at most two neighboring intervals I'j and If that 
contain J. We count them separately, and for the rest we use Lemmaj^to get \Lj\ < 2 + 2 ^ 7 ") G(x)dx. 

For an interval I G V, we expect 2 J). G(x)dx > 1, as the predicates must have failed for the parent I' of I. 
However, Lemmarequires that w{I') < 2w{I). This can fail to happen near the boundary of Ij, as noted 
earlier. But then there are at most two such intervals. Therefore, the number of intervals in V coming from 
the non-recursive calls is at most 2|$| + 2 G{x)dx. Combining this with the bounds on \Lj\ 
and \Mj\ we get 

|Af(/o)| <4|$|+2 / G(x)dx+^|Af(J)|. (7) 

Vo\Uj(I'jUI’J) 

To open the RHS recursively, we introduce the notion of cluster tree T/^ with respect to an interval Iq: 
It is the smallest subtree Tq of Ty rooted at a cluster C such that Iq <G I^; since by assumption Iq C nlz(f), 
in the worst case, Tig is Tj. Moreover, as enlarging Iq increases the integral in Q, we further make the 
simplifying assumption that Iq = 2Icg , where Cq is the root of Tjg. 
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Let C be the cluster associated with a node u in Tj^ . Let G $ be the interval returned the first time C 
is detected by Newton-Incl-Exc . Define := {I'j^ U /" ) \ J„; if C is not detected, let = Ju = 0. Using 
this notation, the following bound can be derived from ([^ by induction: 


|Af(/o)| <4|r,J+2 / G{x)dx. 


( 8 ) 


For a ssc C G T/^, the assumption /g = 2Icg ensures that Ic '!= lo- So Corollary implies that 1'^ U I" 2 2c; 
and Lemmaimplies that Q Ic] hence, 2 2lc- Considering only the ssc in T/^ on the RHS of Q we 
obtain 


|Af(/o)| <4|T/J+2 / G{x)dx, 

J Io\UcAc 


(9) 


As \Tig \ < n, we get Theorem]^ 


4.2 Bound for the Descartes’s rule of signs 

In this section, we derive the following bound: 


Theorem 10. Given a square-free polynomial f G M[a;] of degree n, the size of the subdivision tree constructed 
by Newton-Isol(f, Iq) using predicates based on the Descartes’s rule of signs is bounded by 0 ( 7 T,lnn). 

We bound the RHS of ([^, where the stopping function corresponds to the Descartes’s rule of signs. We 
use the same stopping function as described in [Bur 13] . but explain why the argument there fails to give us 
the bound above. 

Let V :=Z(f), the set of roots of /. Define d{x,V) as the distance from x to the closest point in V, 
and d 2 {x,V) as the distance to the second closest point in V. The crucial idea in [Bur 13] is to partition 
the integral over the (real) Voronoi interval la of each root a (for the moment suppose a G K). Define 
Ja ■= [a ± Then for x G Ja, G{x) '.= 2/ d 2 {x,V), and for x € Ia\ Ja, G{x) :=l/|a; — a\. Break 

G{x)dx as G{x)dx + G{x)dx. In [Bur 1 3] it is shown that the first integral is 0(1), and the 

second integral is 0{\ogw[Ia)/d 2 {a,V))] from Cauchy’s bound we can assume that w{Ia) = 2 ^^G ^ The 
problem is that in the worst case this ratio can be il{n{L + logn)). E.g., if all the other roots are of the 
form a ± it, for increasing values of t, then is the x-axis. Therefore, fj G{x)dx = U(L — log d 2 {oi, U)); 
in the worst case ^ 2 ( 0 :, V) can be the root separation bound. 

Our idea is based on the observation that roots with very small separation give rise to root clusters. For 
clusters that are not ssc, the ratio Rcjrc = 0{n^), therefore, the number of subdivisions needed to bridge 
this gap is O(logn). For a ssc, the gap is bridged by Newton-Incl-Exc so that the subdivision is restricted 
to the ranges Rc to roughly Rc/n^ and |C|rc to rc, both of which take O(logn) subdivisions. Doing this for 
all clusters basically gives the bound in Theorem [TOj 

Let P C C be a pointset such that any non-real point in P also has its complex conjugate in P. Such a 
set of points is called dense if no proper subset of P forms a non-trivial cluster, i.e., for all S C P, such that 
151 > 2, the disc 3Ps contains a point from P\S. This structure plays a fundamental role in our arguments, 
as do the following two integrals (see [BurI3[[m^ l: 


Lemma 11. Let 7 G C and J = [r, s]. 
(Re) // 7 G K \ J, then 



In 


7 — s 
7 — r 


(5(J>7) 


where 6{J > 7 ) = +1 if r > 
(Im) // 7 G C \ K then 


7 and —1 if s < J. 



< O 


In 


max{|s - 7 I, |r - 7 I} 
|lm( 7 )| 


( 10 ) 


( 11 ) 
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We now give the proof of Theorem [T0| 

Proof. The proof is by induction on We claim that 


'-fo\Uc^C 


G{x)dx = 0(|T/j,I Inn). 


( 12 ) 


Let Co be the root of by assumption we have Iq = 2/c„. Let Ado be the children of Co in T/^. Consider 
a ssc C S Ado- Then /o \ Ale C (Iq \ Xc) U 2Ic. If C is a ssc contained in C, we can inductively remove Alc' 
from Ic- This also works for clusters that are not ssc in Ado, since by definition Xc = 2/c. Therefore, 


/o \ UcAlc C (/g \ UcGWIoIc) U (UcgA^o (2^C \ Uc'ccdlc')) ■ 


We claim that 




G{x)dx = 0(|Ado| Inn). 


As |Tc| < |T/„|, for C e Ado, by induction we obtain 


(13) 


> 2Ic\Jc’ 


G{x)dx = 0(|Tc| Inn). 


This bound along with (13) and the observation that |Ado| + l^cl < l^/ol gives us (12). The base 

case is when Adg contains only leaves, in which case ( [I^ reduces to (13). 

We next claim that 


Go\Uc^Mo^C 


G{x)dx = O(lnn) 


G'g\jC&MgIC 


G{x)da 


where /q := [mco ± 2rco]. If is not a ssc, then this is clear as /(, = 2/co = lo- If Co is a ssc, then 
Io = [wCo±|Cokco]- Break Jo as/ q, [mco+2rco, mco + |Co|rco] and [mco-2rco,TOCo-|Co|rco]. The closest root 
to any x in these intervals is from Cq. Moreover, as \x — mcg\ > ^.rcg, we get G{x) := l/d(x, V) < 2/|a; —rnc„|. 

Therefore, from Lemma IHRe) it follows that Jmc°+ 2 rc^^‘^° \x-mc \ ~ C>(ln|Co|). Similarly for the other 
interval. Hence to prove ( 


T^, it suffices to show 


Go\Uc,=MqIc 


G{x)dx = 0(|Ado| Inn). 


(14) 


Let Ado also denote the pointset obtained by replacing each C S Adg by its center me- We will use 


Lemma [T^ to prove (14). As no subset of Ado forms a cluster, Adg is a dense pointset, and Lemma [T^ is 


applicable. However, we first remove some region around every p £ Adg H M to be able to invoke Lemma [T2| 
For each such p, define Jp := [p ± d 2 (p, Ado)/2]. If p = me, for C £ Adg, then Xp :=Xc C Jp. We claim 




G{x)dx = 0(|Ado| Inn). 


(15) 


L 


From Lemma [T^ we get 

G{x)dx = 0(|Ado| Inn). 

* -^o\(^peviQ Jp 

Combining these two bounds, along with the observation that the union of the sets Jq \ (Upg^„ Jp) and 
Upg;vio(Jp \ ^p) i® III® ®®f Cg \ Up^Mo^pt completes the proof of (14). 

To prove (15), we show that Jj G{x)dx = O(lnn), and then sum over all p £ Adg- There are three 
cases to consider: 

(i) p = me for some normal cluster C £ Adg. Then Jp = [mc±Jic/2] and Ip =Xc = [mc±2rc]. Therefore, 
Jp \ Xp contains [me + 2rc, me + J?c/2] and [me — i?c /2, me — 2rc]. The nearest root to any x in these 
two intervals is from C. Since x is outside 2Ic, it follows that d{x,V) > \x — mc\/2. Therefore, 
G(x) := <2l\x — mc\. From Lemma j^Re), we obtain G{x)dx = OflnRc/rc). Since 

C is not a ssc, Rcirc = 0{n^), which gives us the desired bound. The same applies to the other 
interval. 
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(ii) Suppose p = me, where C e Mo is a ssc. Then Jp = [me ± Rc/‘A Tp = Tc = [me ± Re/ri^]- Let 
■= [me + rnc + be one of the intervals in Jp\Ip. The nearest root to any x £ Iq is from C. 
Since x ^ 2Ie, it follows that d{x, V)>[x — me[/2. Therefore, G{x) := < 2/[x — me[. Applying 

Lemma|ll[Re), we get jj, G{x)dx < 41nn. Similarly, for the other interval. 

(hi) p is a reaTroot then Ip = 0. For x £ Jp, our stopping function G{x) = 2/d2{x, P), i.e., corresponding 
to the inclusion predicate. Suppose q £ P is such that d 2 {p,P) = [p — q[. Then for all x £ Jp, 

2dx ^ A rP+^p/2 dx 


d 2 {x, P)>\p-q\-\p-x[> Up/2, and hence ^ = 0(1). 


Q.E.D. 


The proof above can carried out with the exact constants involved in the definitions of /c, 2c and Ae (see 
Lemma|^, but they will be absorbed by the big-0 notation. Note that the O(nlnn) bounds the number of 
calls to the Cq predicate. The specialization of G{x) for Gq is l/d(x, V). The corresponding specialization for 
Sturm sequences is l/d(x,V H K) < l/d(x,V). Therefore, 0{n\nn) holds for Newton-Isol combined with 
Sturm sequences. For Eval, one specialization of the stopping function for the Co predicate is n/d(x,V), 
which immediately gives an O(n^lnn) bound for Newton-Isol combined with Eval. Whether it can be 
improved using the more precise specialization \x-a\ remains open. 

Let P be a dense pointset P. Given a point p £ P, define Cp := min [p — q[, where q £ P \ {p}, and 
Jp := [p ± CTp/2]. We want to bound /( 2 _DpnR)\u J dx/d(x,P). We first show an 0(|Pp) bound, essentially 
following [Bur 13] . Let Vp be the set of points in 2Dp n K closer to p than to any other point in P. It is clear 
that Jp C Vp. The intervals Vp partition 2Dp n M. Then dx/d(x,P) can be shown to be bounded 

by 0{ln{r{Dp)/ap)). Using the density of P, it can be shown that if p,q are such that eXp = |p — q[ then 
P C D^p gy, which implies that r(Dp) < for allp £ P. This gives an 0(|Pp) bound instead 

of the bound in Theorem |10| To obtain that we need to amortize the integral carefully. The intuition is that 
if CTp is very small then there there must a lot of other points close to p, and hence the width of Vp cannot 
be very large compared to Up . The challenge is to get an “almost cluster-like” decomposition of P. We 
construct a tree on P that gives us this decomposition. 

We describe an iterative bottom-up procedure to construct a tree 7p with leaves from P. Let a := miupg p Op. 
For all points p £ P, draw a disc of radius a 12 centered at p. As a is the smallest distance between any 
pair of points, two such discs can at most touch each other. The discs touching each other form a connected 
component. The collection of the largest connected components partitions P (leaves are considered as com¬ 
ponents). Moreover, there is at least one component G C P that has cardinality strictly greater than one; the 
components with cardinality one are the leaves. For each such component G, we introduce an internal node 
u In. Tp with children as the leaves p, where p £ G] let G„ := G, the associated component, and cr^ := a. Now 
redefine a as the minimum separation between the components constructed so far, draw a disc of radius a 12 
centered at each p £ P, and continue as above. Let Tp be the tree constructed in this bottom-up manner; 
see FigureFurther define the following quantities for each m S Tp: 

(i) Vu as the number of children of u, 

(ii) be the center and r„ be the radius of D(Gu). 

Let u,v £ Tp he such that u is a child of u. We have the following properties of Tp. 

(PI) CTu < ininp^c„,qeP\G„ \p ~ q\ ^ The upper bound follows from the density of P. The lower bound 
follows from the observation that the discs with radius (t/ 2 centered at p S Gy, where ay < a < ay, 
do not touch the discs of any other component, except when the radius is au/2. 

(P2) r„ < [Gy[ay. Consider the graph Q with the vertices as Gy and edges between two vertices p,q if 
D{p,au/2) n D(q,aul2) ^ 0. As Gy is a connected component of these discs, we know that Q is 
connected. Therefore, if m is the number of vertices on the path joining p,q in Q, then by triangular 
inequality [p- q[ < may < |G„|tT„. 

(P3) If p is a leaf-child of u then ay = ap. It is clear that any disc D(p,r), with r < crp/2, cannot touch 
D(q, r), for any other point q. The first time they touch is when tT„ = ap. If p € C \ K, then we further 
obtain that |lin(p)| >ap> ay. 

(P4) The size of Tp — 0(|P|). Every level has a node with more than one child, as there are pairs of 
components with separation exactly a. 

(P5) P is the component associated with the root of Tp. 
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Figure 3: A dense pointset P and construction of 7p. Circles of different colors correspond to different tr’s. 
The first choice of a corresponds to blue colored circles, followed by green, orange and red. The components 
formed are shown in the corresponding colors. We only draw some of the relevant circles to give an idea. 


The next result is an amortization analogous to that of the Davenport-Mahler bound over the root 
separation bound. 

Lemma 12. If P is a dense pointset then 


dx 


/( 2 £ij,nH)\UpepJp d{x,P) 


= 0(|P|ln|P|), 


(16) 


where for p G P DM., Jp := [p ± crp/2], and Jp = % otherwise. 


Proof. We break the integral recursively over the nodes of 7p. For an internal node u of Tp, we will show 
the following claim: 

J(2D(G„)nR)\UpeG„ Jp 

We take the sum over all internal nodes u. From (P4) we know that |7p| = 0(|P|), and hence ~ 

0(|P|); moreover, from (P5) we know that the component associated with the root of 7p is P. These 
observations then give us (16). 

For a point p G P, recall that Vp is the set of points in 2Dp n K closer to p than to any other point of 
P; by definition Jp C Vp. Suppose u is the parent of p. We will bound the integral over Vp in two steps: 
the portion of Vp inside 2D{Gu) and the portion outside 2D{Gu). The latter portion is where amortization 
occurs, as for an x ^ 2D{Gu), the distance of a: to p S G„ is roughly |a; — mu\. Let u be a child of u. There 
are three cases to consider: 

Case 1. u is a leaf p G K. We first bound the portion Ip of Vp inside 2D{Gu)\ the portion outside will be 
handled collectively for all points in the third case. For all x G Ip \ Jp, it is clear that d{x, P) = |a; — p|. 

From Lemma [^Re) we obtain that /p \x-p\ ~ ^ g^g g 2Il(G„)nK, we know that 

w(Ip) < 4r„. From (P3), we know that Op = tT„. Therefore, = 0(lnr„/cT„) = 0(ln|G„|), 

from (P2). 

Case 2. u is a leaf p = Re(p) + ilm(p) G C \ M. Again consider the interval Ip := Vp H 2D{Gu)', in 
As p is the closest point to any x G Ip, d{x, P) = |a: — p|. Moreover, p and both 


this case Jp = 


the endpoints of Ip are in 2D{Gu), so the maximum distance of an endpoint of Ip from p is < 2r^ 
Therefore, from Lemma [lT](Im) we have 


dx 


d(x, P) 


= O In 


|lm(p)| 
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But recall from (P3) that |lin(p)| > cr„, hence r„/|lm(p)| < r^lou < \Gu\, where the last inequality 
follows from (P2). Therefore, j^rp) ~ OCl^ilCuD- 

Cases, u is an internal node. Inductively, we have already bounded the integral j^2D(G )nR)\u ec J dx/d{x,P). 
However, it is possible that Vp, for some point p G Gy extends beyond 2D{Gy) fl K. Suppose p is such 
a point, and x S Hp := Vp n (2D{Gu) \ 2D{Gy)). Then we know that \x — p\ > |a; — m„|/2, where 
is the center of D{Gy). Therefore, 



2dx 


t (2D{G^)\2D{Gy))m. 


As 2w{Iu) = 4r„, from Lemma [l^Re), it follows that the integral on the RHS is bounded by 
0(lnr„/ri,). But from (P2) we have < |Gu|(t„, and cr„ < 3r„ from (PI). Therefore, we obtain 



dx 

d{x, P) 


0(|ln|G„|). 


This is the case where the amortization of the integral over the Voronoi regions takes place. 
Summing the bounds for all children u of u gives Q.E.D. 

The following is the analogue of Lemma 12 in C: dehne Dp := I?(p, crp/2), then 


dz 


> ‘IDp\\JpD^ 


d{z,P) 


= 0(|P|ln|P|). 


5 Concluding Remarks 

Our aim has been to devise a general approach to improve any subdivision based algorithm for real root 
isolation. This is achieved by the Newton-Incl-Exc predicate, which detects strongly separated clusters, 
and hence reduces the number of subdivisions from 0{\ogRc/rc) to 0(nlogn). The crucial ingredient is 
Ostrowski’s criterion based on deviations of the Newton diagram of a polynomial. The criterion works for 
complex polynomials, so we expect an analogue of Newton-Isol for isolating complex roots that is concep¬ 
tually simpler than the existing approaches. We have not explored the practical aspects of the algorithm, 
nevertheless, we think that the analysis based on the geometry of cluster provides tools and techniques for 
an alternate approach to understand existing algorithms. 

We can bound the arithmetic complexity of Newton-Isol as follows. The Newton diagram computation 
takes 0(n), and the Taylor shift 0(n log n) operations. The number of Newton iterations to approximate C 
is bounded by 0(loglog'^), which is 0{log{nL)) from root separation bounds. Therefore, the arithmetic 

complexity, ignoring poly-log factors, is bounded by 0('n?). The extension to the bitstream model involves 
deriving a robust version of Ostrowski’s result and bounding precision requirements. The latter will be 
governed by perturbation bounds for clusters. For a cluster of size fc, we expect these bounds to be 0(e^/^), 
for e-perturbation in the coefficients. In the worst case, this would give an 0{n{L + logn)) bound on the 
precision. 
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Appendix 


We give the proof of Lemma the arguments are based standard manipulations with Taylor series in alpha- 
theory of Smale et al. We first prove Lemma |^i), for which we need the following functions from [BCSS98] 
defined for 


^k{z) = 




f(k){z) 


, lk{.z) = max 
f>i 


f(k+0)[z) 


(j + 


(18) 


and ak{z) = Pk{z)jk{z). We derive relations between these quantities and pk{zys given in ([^. Considering 
the RHS of Pkiz) for j = fc — 1, we immediately have 


Pk{z) > 


fk-i{z) 

fk{z) 


= k/3k{z). 


(19) 


Multiplying and dividing the inner term on the RHS of jkiz) in ( [l^ by {k + j)l/kl we obtain that 

lliclX 

j>k 

( (fc + j)! 


( {k+j)\V/^ 

')k{z) < max —-— max 

^ - 3>l \k\{j + 1)\J 3>k 


fkiz) 


= max 


1 


T>i\kl{j + 1)\J Pk+i{z)' 
The max-term is bounded by (fc -|- 1), which implies that 

'ykiz)pk+i{z) < (k + 1). 


( 20 ) 


Multiplying @ and ( [2^ we obtain that ak{z)Ak{z) < 2. Therefore, if Ak{z) > 12 then z is an approximate 
zero of with associated root in D{z, 1.5/3fc(z)) C D{z, where the inclusion follows from ([T^. 

The claim on Newton iterates follows from |BCSS98l p. 160,Thm.2]. 

We now prove Lemma i ii). We will need the following result |BCSS98l p. 161, Lem. 3]: for a u G [0,1) 




3>0 


(l-u) 


fc+l ■ 


( 21 ) 


Let e := 1.5, <5 := epk/k, and u := = k^\ here we express Pk{z) by pk (similarly, for the other quantities). 

Lemma 13. If z is such that Ak{z) > 16 then for z' G D{z, i5), we have \ fk{z')\ > |/fe(z)|(l — m)“^^+^)/2. 

Proof. Take the absolute values in the Taylor expansion of f^^^z') and apply the triangular inequality 
to obtain 

\ J>1 




5^ 


Dividing both sides by |/fc(z)|, and multiplying and dividing the summation term on the RHS by fc! and 
(fc -I- j)! we obtain that 


fk{z') 

fk{z) 


M'-i: 


l>i 


k + j 

j 


fk+j jz) 
fk{z) 


5^ 


From the expression of pk+i{z) in (|^ and definition of u, we deduce that 


fk{z') 


fkiz) 



k+j 


Using (21), and the bound on Ak, the RHS can be simplified to (1 — u) ('^+^1/2. 


Q.E.D. 
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Lemma 14. If Ak{z) > 16 then for all z' € D{z,S) we have Pk{z') < 2e^pk{z) and pk+i{z') > pk+i{z)/3. 
Therefore, Ak{z') > Afc(z)/(6e®). 

Proof. For j < k, take absolute values in the Taylor expansion of fj(z'), apply triangular inequality, and 
split the summation up to k and beyond k, to get 


i/j(^')i = i/iWi + Y 1 

i=i 


f+j 

i 


\f£+j{z)\S^ + ^ 

£>k—j 


' + j 


\h+Az)\s^. 


Divide by \fk{z)\ and use the expressions in ([^ to obtain 


fkiz) 


e=i 




£>k—j 


f + j 
i 


£-\-j — k ' 
Pk+i 


Since S = epk/k, we can pull out 


k—j 

Pk 


from the RHS (and since > 1), we get that 


fiiz') 


fk{z) 





Assuming k>2, from (21) we obtain that 

fjiz') 


fk{2 


<pk 


('-i) 


£>k—j 


- 0 + 1 ) 



Combining this bound with Lemma |13[ and doing some further simplifications we obtain the upper bound 
on Pk{z'). Note that we require k >2 > e. To derive a lower bound on pk+i{z') in terms of pk+iiz), we take 
absolute values in the Taylor expansion of fj{z'), for j > k, apply the triangular inequality, and divide both 
sides by \fk{z)\, to get 




fk{z 


< 




fk[z) 


+E 

i>i 


+j 

I 


fe+jiz) 


fkiz) 


6^. 


From the expression for pk+i in (§ and (pTj) it follows that 




fk{z) 


Pk+l 


-L+i) 


Combining this with the lower bound in Lemma [T3| and using the lower bound on A^ we further obtain that 


fjiz') 


fk{z’) 

Since u < e/{kAk) and Ak > 16, we get that 


i/O-fc) 


< 


21 


— k 


(1 - u)pk+l ' 


fjiz') 


fkiz') 


1/0-fc) 


< 


Pk+l 


which implies the desired lower bound on pk+i{z'). Q.E.D. 

To show Lemma [^iii), we suppose that pk{w) < Pk{z). As the two inclusion discs intersect it follows 
that 

\w ± 3pk{w) - z\<\z-w\+ 3pk{w) < 9pk{z) < P'^+y\ 

where the last inequality follows from Ak{z) > 27. This implies that D{w,3pk{w)) C D{z, and 

hence both the discs have the same cluster. 
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We next give a self-contained proof of Pawlowski’s result |Paw99[ Thm. 2.2], The difference in our proof 
is that we avoid using Enestrom-Kakeya theorem. The key idea of using Walsh’s representation theorem 
[R,S021 Thm. 3.4.1c], however, is common to both the proofs. 

Given a cluster C of size k, and z G C, let fi{z) = lead(/) ~ = n/3Gc('^ ~ From 

Leibniz’s formula we have 

i=msix.{0,j-\-k—n} 

We will focus on the case when j < k, in which case the upper bound of the summation is j. The bounds 
on the summation are required because fi cannot be differentiated more than k times and similarly for / 2 . 
Applying Walsh’s representation theorem, first to fi we obtain that there is an a G D(rn,rc) such that 

/W(z)= ^ f^]kik-l)---ik-i + l){z-ar-^ft'\z). 

i—maK{0^j-\-k—n} 

Now applying Walsh’s representation theorem to / 2 , we know that there is a /? ^ D(m,Rc) such that 
fii^z) = ^ k{k—l) ■ ■ ■ {k—i+l){z—a)'^~'^{n—k){n—k—l) ■ ■ ■ {n—k—j+i+l){z—P)'^~'^~^~^^. 

i—max.{0,j-\-k—n} 

Opening up the binomial (■]) and simplifying we obtain 


fjiz) = 


f^^\z) 


j! 


E 


n — k 


(z, . (z - 


J - ^ 


Pulling out the last term from the RHS we obtain that 


/.(z)= (^^.)(z-a)'=-^(z-/3r-'= 


j fk\ (n-k\ 

I z - a 

i—TS\a^{{),j-\-k—n} yj' 


note that when j = k, the term (z — a)^ ^ is one. Now we substitute i by j i to obtain 


/,(z)= y(z-a)'=-^(z-/3r-'= 


ain{j,n—k} f k \ fn—k 


E 

i=0 




Z-P 


( 22 ) 


The fraction 


^ ) _ 1 (n-k)\ j\ {k-j)l ^ 1 f{n-k)j 


< - 


i\ (n — k — i)\ (j — i)! {k — j + i)\ i\ \ {k — j) 


note that for j = k, the denominator does not appear; we capture this by using the notation (x)i := max {1, x}. 
Therefore, the summation in (22) does not vanish if z satisfies the following inequality: 


1 > ^ 1 r j{n-k) 

2- ^ 


z — a 


z-P 


Substituting the upper bound of the summation by infinity, we get the following stronger constraint: 

\ (j{n- k) 

2 - d 


2 — Z! 

i>l 


{k-j)i 


z-P 


Adding one on both sides, and using the observation that the RHS is the expansion of exp(^ |[,” ^), 

we get that the inequality follows if 


In 1.5 > - > 
~ 4 ~ 


j jn - k) 


(23) 
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Since a € D{mc,rc), we have 


\z - a\ < \z - mc \ + rc < 2max(|z - TOc|,rc), 


and 


1-2 - /3| > \mc - (i\ - \z - mc\> Rc - \z - mc\ > Rc - max{|z - mc|,rc} . 


These bounds imply that 


and hence (231 follows if 


\z - q| ^ ^ max {Iz - me I, rc} 
\z-j5\~ (i?c - max{|z - TOc|,rc})’ 


Rc > max{|z 


mc|,rc} 



8j(n- fc) \ 
{k-j)i )- 


To summarize, we have the following result: 

Lemma 15. Let C be a cluster of size k and j € {0,..., k}. If z G C is such that 


i?c > 2max{|z - mcl.rc} , (24) 

then 

f,iz)=Q{z-a)'^-^iz-f^r-Hl±l), 

for some a G D{m, rc) and /3 ^ D(m, Rc); the notation “± ” stands for a 0 G C such that |0| < 1. Moreover, 
if z also satisfies \z — mc \ > rc then f^^^z) 0. 

We specialize this result to the case of a strongly separated cluster: 


Corollary 16. For a strongly separated cluster C, if z is such that 


me I < 


R^ 


then for 0 < j < k, 

/,(z)=Q(z-a)'=-^(z-/3)"-'=(l±2), 

for some a G D(m,rc) and P ^ D{m,Rc). Moreover, if z also satisfies \z — mc\ > rc then f^^\z) 0, for 

0 < j < k. 


Proof. Note that the maximum value of the term j{n — k)/{k — j)i is obtained at j = k, and it is k{n — k). 
From the AM-GM inequality, we know that k{n — k) < n^/4. Therefore, (24) follows if z and rc are such 
that 

Rc > 4n^ max {\z — me \,rc} ■ 


For a strongly separated cluster C we know that Rc > 4n^rc, and the condition on z is the condition in the 
corollary. Q.E.D. 


We use this result to show that (fc — j) roots of the jth derivative are in D{mc,rc) and the remaining 
are outside D{mc,Rc/{‘2n‘^))- Let ai,... ,ak be the roots of / in C and let Pi,. ■. , Pn-k be the remaining 
roots. Let gt be the polynomial with roots (1 — t)mc + toi,..., (1 — t)mc + tak, Pi, ..., Pn-k- Thus go{z) = 
{z-mc)'^ Y\.pGci^~ 51 ( 2 ^) = fi^)- Since gt{z) has a strongly separated cluster of size k in D{mc,trc), 

from the lemma above we know that g[^\z) does not vanish on the boundary of D(mc,rc). As the roots 
vary continuously with t, and gQ^z) has a root of multiplicity (fc — j) at me, it follows that gt^\z) has 
k — j roots in D{mc,trc) and the remaining roots outside D{mc, Rc/{‘^n^))- Substituting t — 1 gives us the 
desired result. To summarize, we have obtained the following result: 


Lemma 17. Given a strongly separated cluster C of size k, for j < k, there are k — j roots of the derivative 
f^^\z) in D{mc,rc) and the remaining (n — k) roots are outside D{mc,Rc/{‘2-n^)). 
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